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This paper discusses an Vues gation that compared 


college students reading qains when using general cr specific 
diagnosis. It reports that siqnificant gains.were made.on many 
‘subskills of the Stanford Diagnostic Reading Test by the group that 
had the prescriptions developed as a result of a differential 
(specific) diagnosis and that there were no’ significant gains noted 
on the Nelson-Denny Reading Test, a general survey test. An analysis 


of two other research studies on this topic shows that results of 


‘3 


this study support the results of a study by G. L. Cox and J. E. 
Swalm and contradict the results of a study by B. D. ‘Smith. It is 
noted, however, that since Smith used the Nelson-Denny Reading Test 
to test gains from specific diaqnosis, her findings are nct 

. necessarily valid and not necessarily contradic@tory to those of the 
other two studies. The evidence in the literature indicating that the 
Nelson-Denny Reading Test’ is not an appropriate test for evaluating 

_ the reading ability of colleae students in developmental rrograms and 
. that ‘survey tests’ are often used inappropriately as measurements of 
achievement in college reading proqrams is discussed in the light of 
‘the findings from these research reports. (MKM) ° 
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of Reading (Smith, 1977). The research investigation (Flippo, 1979) 


. gains when using a survey or less specific diagnosis invalidates Smith's 
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A Proposal: The Need for Comparison Studies of College Students’ Reading Gains 
in Developmental Reading Programs Using-General and Specific:Levels of Diagnosis © 


a 


Rona F. Flippo 


This article is a rebuttal to an article published by the Journal 


exploring a comparison of college students’ reading gains when using 
— 


differential or more specific diagnosis and college students’ reading 


findings. Significant ‘gains were made in many subskill's by the: group 


that had their’ prescriptions developed as a result ¢f. the differential 


diagnosis. : 
‘' A thorough review of the literature from 1930 through 1978 re- 
: : 


veals only two research studies that explored thie type of compari- 


- 


son (Cox:.and Swalm, 1973; Smith, 1974). In her 1977 article Smith 
never mentions, the Cox and Swalm study. Evidently, at the time Smith. 
(1974) completed her dissertation, fromwhich she wrote the article 

in question (Smith, 1977), the National Reading Conference Yearbook, 
which contatned the ere of the Cox and Swalm (1973) study, had 


not been available. Although e Cox and Swalm study and the Smith 


study are the beginnings: of a bo of research in this area, they. 


~ 


both have their limitations in the relatively’ small sizes ofvtheir 


groups, Cox and Swalm (n=98) and Smith (n=75). Additionally, the 
be 
findings of the two studfes are contradictory to each other;- how- 


ever, the validity of the Smith study has been severely limited 


< ; 
by the..research design of the study because it did not allow for the 
: s , . "y 
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posttesting of the groups on all, the tests with which they were pre- 
tested and because Smith, the ricear chee See also the instructor of 
both groups. i : . 
et 2 . 

The purpose of: the Cox and Swalm (1973): study was to determine 
if form “diagnosis before ‘instruction is beneficial as compared to 
infor al didgnosia during instruction; however, because of the nature 
of their stad thas also compared formal or differential initial di- 
“agnosis to a more general pr survey-type initial diagnosis. High-risk 
college: freshmen at Rutgers Wiieeewday when eakanbd with the McGraw= . 
Hill Reading Test and the comprehension section of the Nelson-Denny 
Reading Test And randowly assigned to two groups. thie Weudenti in both 
groups were taught by eight a¢Prerent tnsteudtoxa. Methods -and course 
content were the same for the groups, except for the amount of diagno- 
sis each received from test: results. One group was prescribed after an 
analysis of specific test items on NUbeEveas of thie McGraw-Hill, or af- 
ter differential deapiaate. The other, group was prescribed on the over- 
wie sakes in each of the three major sub-areas of the McGraw-Hill. Both 
groups were posttested at the conclusion of the term with another form 
of the same tests. Students receiving ete more differential initial- di- 
agnosis scored significantly wishes at nee .05 level than students in 
ths other group on the McGraw-Hill and the compreHension section of ‘the 
.Nelson-Denny reading tests. The results of the Cox and Swalm study 
indicate that instruction based upon the diagnosis of specific reading— 
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strengths and weaknesses is superior to skill instruction based_on 


general evaluation of students' reading abilities. . 2 oi 
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, Smith Ca) extuined he value of additional diagnosis. in com- 
pfehension. aaa BUR Ee in a invelopubital reading program at es 
Junior College. ‘The students were pretested with the Nelson-Denny 
Reading Test, Form A. ‘the Van Wagénen Reading beaten, and the calif- 
-ornia Phonics Survey, and, randomly assigned to two groups, both taught 
by Smith. Reading instruction for one group was based oth the diagnos- 
tic information gained fro all three teste. Reading instrudtion for 


the other group was based on the survey results of the Nelson-Denny. 


The only MeL ESrenee between. groups was the amount of reading diagnosis 


each received from the pretests. The Nelson-Denny Reading Test, Form 


B, was the only test administered to both groups as a posttest at the 


end of the term. An analysis of covariance showed no significant dif- 
ference at the .05 level in Sie Azo sali between the groups on ‘tie 
Nelson-Denny; however, the group receiving the differential diagnosis 
did show an observed gain higher than the group having just the survey 
diagnosis. ‘The results of Smith's study, using the Nelson-Denny data, 
indicate that instruction based on sidtiaeal gaaionts beyond the sur- 
vey test is not more successful than instruction based on survey test 
Gi mee: one The findings of Smith (1974) appear to be contradictory 
to the findings of Cox and Swalm (1973). But a third study (Flippo, 
1979) indicates that the smith findings are not necessarily valid and 


z ‘ 
not necessarily contradictory to those of Cox and Swalm. 


» “A Third Study 


» 


A third study (Flippo, 1979) was designed to compare the effects 
of specific and general diagnosis on postsecondary students' progress 


. & y 7 a 
in selected reading subskills after developmental instruction. In-~~.— 
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struction was’ presprdbed, fron the specific or ii Fteeenti at results of 
a a diagnostic reading test for some students and hick the ‘general and. 
| more traditionally used results of a survey reading test for other 
ebudentes The selected wghnttiia qaxe literal compfebewsion, infer- 
‘ential constaneualin: word meaning, word parts, phonetic analysis,, 
structural analysis, scanning and skimming / and fast fend tans as meas- 
ured by the Stanford Diagnostic Reading Test (SDRT), Blue Level (1976), 
and vocabulary and comprehension, as measured by the Nelson-Denny Read- . 
tng Test (NDRT), Forms c and D (1973).. - ge OS 


The Nelson-Denny was selected because it had been used in both 


the Cox and Swalm, and Smith studies and because surveys had indicated 


that the eledavpenty is the test most widely used in developmental 
eutiaes reading programs (Lowe and Stefurak, 1970; Goodwin, 1971; 
Sweiger, 1972; Landsman and Cranny, 1978). The rate subtest of the 
Nelson-Denny was used in writing prescriptions for the control group 
ae traditionally done, but not as a daveadent variable in this etudy. 


- 


Farr (1968) criticizes the rate subtest of the NDRT for lacking con- 
, -& t 


vergent,and discriminant validity, and he. suggests it should not be ™ 


used at all. 
The Stanford Diagnostic, Blue’ Level, was developed to meet the 


\ 
reading evaluation needs of grade 9 through junior/community* college 


students. Van Roekel (1978), in his review of the newest edftions of 


the SDRT, stated, "The SDRT has few peers among group diagnostic read-' 


¢ 


ing Peet se Apel 292 he This test met the needs and suggestions indi- 
i] 
cated for college developmental /reading by Ironside (1969), Tittle 


and Kay (1971), and Evans and Dubois (1972). It was the best diag- 
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De aw ee ig 
nostic instrument available for this population at the time of the. 
study. _ : , “eas 7 2, : 


» 
‘ 


The sample consisted of, approximately 450 entering freshman stu- 
dents enrolled in fifteen different sections of "Effective Reading" 


in the 1978 fall semester at the University: of. South Carolina's Coi+ 


° - 


lege of General Studies, Columbia. The résearchet's classes and evan-| 


ing classes were not included in the study., These fifteen sections 


. 


were taught by nine different instructors; students were randomly a > 


assigned to an, experimental (x) or control (x) group so that there 


was approximately an equal number of each group in each section., The’ 
i a . £ : o 
fréshmen were required to take "Effective Reading" because they scored 


390 or below on the verbal portion of the SAT. The actual number of 


students included. in the study from the original group of approximately 


450 were the 226 students (x=112, x=114) who completed two counseling- 
° . ’ . P P 3 ‘ ° 
prescription appointments, a minimum of 10 lab hours in the Develop- 


mental Center, and all pre and posttests on the SDRT and NDRT. 


On the basis of information from the subtests of the diagnostic 
i 7 4 : ba , . . 
instrument (SDRT)', individual prescriptions for reading skill improve- 


ment were written for the students in the experimental group. Only 


X : ? \ 
the- information from the subtests of the survey instrument .(NDRT) were 
lal ‘ 


used in writing prescriptiogs for the control group. The original diag- 


noses from the pretest information entthe students in both groups were 
com 
not altered in any way throughout remediation. All students were treat- 


” e ‘ 
ed with two hours per week of ,classroom instruction with their instruc- 


. . * . ; 
tor and one hour per week of lab instruction. Lab instruction was based 


Sp) 


on each student's individual ‘prescription. Students in both ve con- 
trol and cnpetinanial groups hid, the same treatment exept for i 
amount of diagnostic or differential information available for the 


‘initial counseling and prescription of instruction. 


Students were not aware of the ai ffereatiation of diagnosis oe 
the control and experimental groups. All the students vere pretestdd. 
.with both the diagnostic (SDRT) and survey CHET) instruments. Stu: | 
“dents in the experimental group had. all of ‘their tests handscored a d 
individual profiles were made Batael raw scores and stained: from the., 
SDRT and raw scores and grade equivalents ‘from the NDRT. Students 
‘the~tontrol group only had their NDRT scored, and individual. profiles 
were made from only the NDRT data. The SDRT's for oGeEL weadent } 
were left unscored and. locked in a file cabinet until the end of the 
term. . : ji | ww 

The diagnosis and prescription of the two groups ‘were accomplished 


by use of a card system designed by ene researcher erepelatay for the 
| 


testing ‘instruments and materials euieetes for: this study-, (Flippp, 1980). 


The card system was programmed by thg researcher saealopiis all/the lab 


materials into one or more of the ten, selected subskills foy this study, 
& ? 


plus the subskill of rate from the NDRP. Graduate assistants were’ 


trained to use ‘thas card .system to nae appropriate decisions regard- 
ing the prescription of materials according to the pretest results when 
‘sadneatind atadenes in both groups. Instructors were given the pretest 
results used for ere es and prescription, with the more specific 


diagnostic information on each of their experimental students and only 


the more general survey test information on their control students. 


“I 


. 


At the end of the semester, all students were posttested /with 
different forms of the SDRT and NDRT. The control group's SPRT pretests 
were then scored and the raw score data.recorded., The gain in the ten 


! 


selegeed subskills was measured by the pretest ane. poetkedt differences 
of the two groups. . se “pool test was used.at chs 05 level of signif- 
icance, . \ 
os ; = Ne Findings 

There was a stat tetically ‘significant Abttetetce in “the means of 
the pain sqBres' ¢ of fhe al a heal or Ate hetentte + Ay diagnosed stu- 
dents‘in the pulerslee of Mteral comprelicitsion, inferential compre- 
hehsion, vord: meaning, phonetic anatyaie, structural analysts, and 
scanning and ahinaitng: There was not a tatistically viitpioant di f- 
- ference between the means of the isin es of -the expergmental stu- «4 


dents ana the control students in the subskills of word parts, fast 


reading, vocabulary, and compreherision. However, on the subskills of 


fast reading and vocabulary, the observed gain of the experimental a, 
“eae was substantially maenee than that of the control group, but ,not 
ssend Sloan key higher at the .05 level. see summary table dnewing 
dcmpabtoen of raw score gains on each of ane ten selected subskills.) 
This:study's Findings generally support the sone ludene — \ 
and Swalm (1973) that postsecondary students do better sin developmental 
reading programs that utilize more specific op differential: diagnosis 
and prescription than in reading programs using aces general testing 
of jasaine abilities. However, in this ae as’in the Smith (1974) 
‘study, the gains from the Nelson-Denny (voeabulaty and comprehension) 


did not reflect evidence of a significant difference. If this study 


Q 


. | ae: a oe ; 


had solely used the Nelson-Denny to. measure the gains, as Smith did, 


the findings would have magi cada the en cdutone of Smith rather. 
than those of Cox and Sekine : iy : : Pee 
a 


<2 


> F a e 4 
: Gains made as a FeeUee of diagnostic or, differential prescription” 


may not necessarily be reflected in nv neues of Survey reading tests. 


- 


. Smith's study dows not willow: foe thie possibility.” If Smith would have 


given the iteaiesksc: tests as posttests as- vell as pECue tes to ‘both: 
us 
her groups and compared those results Peeeeen groups, she might, have ’ 


4 


found ‘that the BEOup with the 4i ferential diagnosis did better as meas— 
uted by Ete dteenoatte tests. incorkinacedys this is not ,-known; ‘there- 


fore, ee Smith study cannot be considered valid. 


‘ 
a . 


These lei suggest that. ‘the cael penny Reading Test, used 


more than any athet in college’ reading. programs, is not consistently 


a good discriminator of gueteaks that have improved more and students 
that have eee Ses less in atte weeeeal PESErARes In this study CFLAppo, ° 


a 
1979) the gaing on the subeksts of comprehension, as “measured. by. the * 


~?* 
. 


Nelson-Denny, do not reflett the gains made on'the subskills of literal 
and inferential Keenceper ddan as measured by the Stanford Diagnostic. 
The t values of literal and inferential comprehension indicate that 
there was a‘significant difference, between’ tte-gains of the groups;— 
yet, there was Nag eit onttieare or observed difference between the caaas 


of the - -groups on comprehension, as measured only by the NelSon- Denny. 


Additionally, the gaing ade on the subskill of vocabulary, as meas- 


ured by the ‘Nelson- “Denny, do not significantly reflect the gains made 
ne ¢ 
on. the subskills of OER suk Die and literal comprehension (vocabulary 


VL 
in context), as aeacueed! by the Stanford Diagnostic. ft’ Keele be 


noted that both "groups improved to approximately the same degree on 


- . 
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the ,subskill of word parts. There is no clear explanation for this 
” f 


! nhendaeddits and Furthes study is needed in this area. 


ae oe Misuse of Test = 
There has, been evidence in the Literature which indicates that 

‘the Nelson- sientiy might not.-be ancappropriate test for evaluating the . 

“reading ability of ‘college students in developmental programs and that 

Survey tests are often used EvePPrOREABtely as a measurement of achieve- ; 


ment in college reading programs.’ Murphy and Davis (949) ‘found that 


scores on the Nelson-Denny could be greatly increased when students, 


_ were encouraged” to mark every item even if they did not know the ans- 


‘wers to the items. This would cause an “inflation of gain scores tenn 


‘ 


pre to posttesting without. any remediation at” all. Orr (1965) found 
the comprehension passages of the Nelson-Denny to be long, involved, 


and grammatically complex.’ . Townsend (1965) suggested that the Heteon- - 

* Denny was a challenging academic type of test useful for wevestiny good ( 
readers going into the liberal arts but would not adequately differen- 
‘tiate réading skills of college students. Farr (1968) pointed uk that 
the rate section of the Nelson-Denny lacks both convergent and discrim~ 
inant validity. 

Tronside (1969) and Evans and, Dubois (1972) cautioned against 
using survey ‘tests for the nieious of assessing achigvement in reading 
or for making a diagnosis. “ Goodwin (1971) found that 60% of the 300 
Siiteeteolbepe reading teachers he surveyed considered dtagnostic the 
survey test riven at the beginning of the term as a screening device,’ 


and the Nelson-Denny was the most widely used. Chester, Dulin, and 


Carvell. (1975) suggested that hana should be made in the noaprehen= 


>: ieee ' 
~ io | | ( 


—— 
ee 


Ev 
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‘materials... 


sion questions of the Nelson-Denny. : They suggested semantic and*syn- 


° . *: 5 i 


| | 
tactic simplification of stems and roots; ‘specificity and clarity 1 
qhest tons that served as a way ‘of identifying information abdut ho 


vant the students # duanand’ the stimudus material being ERSEET: no 


how well ‘ey eould answer certain types of questions; Pee accented’ 


that students would actually need to ask and to answer for themselves 


outside of the testing situation in order to understand the reading 


’ 


‘The literature seems to adieies that cis Ne Lson-Denny is eels 
vey test nat is suitable for Seterning academically able college-bound 3 
atudents., It does not adequately sample the abilities of vocational- ¢ 
education students or postsecondary ‘students in need of remediation 
“programs. This test is not suitable. for the Seeunnere college/develop- 
enieai education market for screening purposes and certainly not for 
aeiaaive achievement in reading in any: college setting. It seems that — 
shia wetetoal flaw of the Smith study and ‘article ts the misuse of this 
test. 'This misuse is chronic in the literature and research relative 
to womecuaa yaa developmental reading programs. 
Recommendat fons 3 . , 
Based on the stat Gationt findings and sdccaetenat implications of — 
this study and the éotnten of this researcher, the following recom- 
aenaatiane are made for further research daa aunetiaeaedonls- Additional 


research investigating a comparison of specific and general levels of 


4 . ~ 
. 


es 


ag ee in college readine programs is neeted. tee Nelson-Denny 

« \ F ‘ ; %. 

should not be ased for purposes beyond that of screening developmental 
reading students and should be restricted for screéning if there is ac- 


cess to 4 more anpronriite hagh, far ents qanrilathon:. Mora dfiagrdostte 


11. 


reading tests designed specifically ‘for measuring the reading and. 


‘ 


_ study skills of postsecondary studerit’s would-be. destrable, and dis~- 


semination of information to community college/developmental reading. 


practitioners on phe presently available tests is gfavely necessary. 
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Number 
umber 


duaawes Table 
Results ~s t-Pool Test Comparing’R 


-05) = 1.960 
Hein” stanteod 
Gain Deviation 
Vv F 
3.5.17 2.910 
1.578 4.030 
2.857 2 667° 
“614, x 3. 385 
2.375 2.812 ~ 
-096 3« 753 
2.357 34221, ° 
2.596 3.422 
2 205 3.649 
tarde 

Zua17 4.024° "4 
1.447 3.5406 
3.473 3,193 

061 4.668": 


m. Score Gains 


“Standard / tt: 
Error "_f - ‘value 


Fi 
- 


274, 
. = 4.140 
ig Stl <-. 
4252 
* 5.526 
: -317 Aa c 2 
.265- ~ ‘ 
. 54594 
e951 
304 
; £541 
5320—Ct ; 
344 - 
4.394 
342 ; 
.380 a 
24124 
waa = fe ’ 
301: 
6.401 
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Significant 
Difference 
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no 


yes 


yes 
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? Sunmary Table 
; . Results of t- Pool Test /Comparing Rew Score Gains 
4 eo, . HARES -05) = Rs -960 
° ees Continued 


Mean Piaece » Standard 


Subskills/Test- Group "Number ™ Gain | Deviation Error 
Fast -  Bxpekdmental - 112 °° 3.133: 3.588 * 339 
= Reading. : ‘ . a: Ay os : 
(SDRT) = * °° Conttol’ |. 12a: 2.368. 3.956 .370 ae 
” , Se Car : | et § i ee 
Vocabulary. ‘ "Experimental 112 “4.526 3.815. "  .360 


_ (NRT) v ‘Control .  -. 114° - | 3.675) 4.062 380 


hee & * 


‘Comprehension’ - Experimental “12 . 5.833} *4,699 hb. 
£ P| ‘ * of ‘ P of ‘ 3 
(NDRTY ©" + Control 1m |i yp 5.842 Aago 589 
@ | i . 
| ¢ 
3 
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